Rejoinder: Struggles with Survey Weighting and Regression Modeling

نویسنده

  • Andrew Gelman
چکیده

Encountering this sort of statement in the documentation of opinion poll data we were analyzing in political science: “A weight is assigned to each sample record, and MUST be used for all tabulations.” (This particular version was in the codebook for the 1988 CBS News/New York Times Poll; as you can see, this is a problem that has been bugging me for a long time.) Computing weighted averages is fine, but weighted regression is a little more tricky—I do not really know what a weighted logistic regression likelihood, for example, is supposed to represent. Constructing the weighting for the New York City Social Indicators Survey (SIS). It quickly became clear that we had to make many arbitrary choices about inclusion and smoothing of weighting variables, and we could not find any good general guidelines. We wanted to estimate state-level public opinion from national polls. If our surveys were simple random samples, this would be basic Bayes hierarchical modeling (with 50 groups, possibly linked using state-level predictors). Actually, though, the surveys suffer differential nonresponse (lower response by men, younger people, ethnic minorities, etc.) as signaled to the user (such as myself) via a vector of weights.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comment: Struggles with Survey Weighting and Regression Modeling

Andrew Gelman’s article “Struggles with survey weighting and regression modeling” addresses the question of what approach analysts should use to produce estimates (and associated estimates of variability) based on sample survey data. Gelman starts by asserting that survey weighting is a “mess.” While we agree that incorporation of the survey design for regression remains challenging, with impor...

متن کامل

Struggles with Survey Weighting and Regression Modeling

The general principles of Bayesian data analysis imply that models for survey responses should be constructed conditional on all variables that affect the probability of inclusion and nonresponse, which are also the variables used in survey weighting and clustering. However, such models can quickly become very complicated, with potentially thousands of post-stratification cells. It is then a ch...

متن کامل

Comment: Struggles with Survey Weighting and Regression Modeling

I appreciate the opportunity to comment on Andrew Gelman’s interesting paper. As an admirer of Gelman’s work, it is a pleasure to read his take on the topic of survey weighting, which I have always found fascinating. Since I support Gelman’s general approach, I focus on reinforcing some points in the article and commenting on some of the modeling issues he raises. As a student of statistics, I ...

متن کامل

Comment: Struggles with Survey Weighting and Regression Modeling

In the ideal samples of survey sampling textbooks, weights are the inverses of the inclusion probabilities for the units. But nonresponse and undercoverage occur, and survey statisticians try to compensate for the resulting bias by adjusting the sampling weights. There has been much debate about when and whether weights should be used in analyses, and how they should be constructed. Professor G...

متن کامل

Comment: Struggles with Survey Weighting and Regression Modeling

This is an intriguing paper that raises important questions, and I feel privileged for being invited to discuss it. The paper deals with a very basic problem of sample surveys: how to weight the survey data in order to estimate finite population quantities of interest like means, differences of means or regression coefficients. The paper focuses for the most part on the common estimator of a po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007